Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 63
Filtrar
1.
J Agric Food Chem ; 72(14): 8060-8071, 2024 Apr 10.
Artigo em Inglês | MEDLINE | ID: mdl-38533667

RESUMO

Smoke taint in wine has become a critical issue in the wine industry due to its significant negative impact on wine quality. Data-driven approaches including univariate analysis and predictive modeling are applied to a data set containing concentrations of 20 VOCs in 48 grape samples and 56 corresponding wine samples with a taster-evaluated smoke taint index. The resulting models for predicting the smoke taint index of wines are highly predictive when using as inputs VOC concentrations after log conversion in both grapes and wines (Pearson Correlation Coefficient PCC = 0.82; R2 = 0.68) and less so when only grape VOCs are used (Pearson Correlation Coefficient PCC = 0.76; R2 = 0.56), and the classification models also show the capacity for detecting smoke-tainted wines using both wine and grape VOC concentrations (Recall = 0.76; Precision = 0.92; F1 = 0.82) or using only grape VOC concentrations (Recall = 0.74; Precision = 0.92; F1 = 0.80). The performance of the predictive model shows the possibility of predicting the smoke taint index of the wine and grape samples before fermentation. The corresponding code of data analysis and predictive modeling of smoke taint in wine is available in the Github repository (https://github.com/IBPA/smoke_taint_prediction).


Assuntos
Vitis , Compostos Orgânicos Voláteis , Vinho , Vinho/análise , Compostos Orgânicos Voláteis/análise , Fumaça/análise , Frutas/química , Tabaco
2.
J Autoimmun ; 143: 103167, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-38301504

RESUMO

IL-23-activation of IL-17 producing T cells is involved in many rheumatic diseases. Herein, we investigate the role of IL-23 in the activation of myeloid cell subsets that contribute to skin inflammation in mice and man. IL-23 gene transfer in WT, IL-23RGFP reporter mice and subsequent analysis with spectral cytometry show that IL-23 regulates early innate immune events by inducing the expansion of a myeloid MDL1+CD11b+Ly6G+ population that dictates epidermal hyperplasia, acanthosis, and parakeratosis; hallmark pathologic features of psoriasis. Genetic ablation of MDL-1, a major PU.1 transcriptional target during myeloid differentiation exclusively expressed in myeloid cells, completely prevents IL-23-pathology. Moreover, we show that IL-23-induced myeloid subsets are also capable of producing IL-17A and IL-23R+MDL1+ cells are present in the involved skin of psoriasis patients and gene expression correlations between IL-23 and MDL-1 have been validated in multiple patient cohorts. Collectively, our data demonstrate a novel role of IL-23 in MDL-1-myelopoiesis that is responsible for skin inflammation and related pathologies. Our data open a new avenue of investigations regarding the role of IL-23 in the activation of myeloid immunoreceptors and their role in autoimmunity.


Assuntos
Artrite Psoriásica , Dermatite , Psoríase , Humanos , Artrite Psoriásica/patologia , Interleucina-17/genética , Interleucina-17/metabolismo , Neutrófilos/metabolismo , Pele/patologia , Dermatite/patologia , Inflamação , Interleucina-23/genética , Interleucina-23/metabolismo , Receptores de Superfície Celular/metabolismo , Lectinas Tipo C/genética
3.
Biomedicines ; 11(8)2023 Aug 13.
Artigo em Inglês | MEDLINE | ID: mdl-37626757

RESUMO

INTRODUCTION: Biogenic amines play important roles throughout cellular metabolism. This study explores a role of biogenic amines in glioblastoma pathogenesis. Here, we characterize the plasma levels of biogenic amines in glioblastoma patients undergoing standard-of-care treatment. METHODS: We examined 138 plasma samples from 36 patients with isocitrate dehydrogenase (IDH) wild-type glioblastoma at multiple stages of treatment. Untargeted gas chromatography-time of flight mass spectrometry (GC-TOF MS) was used to measure metabolite levels. Machine learning approaches were then used to develop a predictive tool based on these datasets. RESULTS: Surgery was associated with increased levels of 12 metabolites and decreased levels of 11 metabolites. Chemoradiation was associated with increased levels of three metabolites and decreased levels of three other metabolites. Ensemble learning models, specifically random forest (RF) and AdaBoost (AB), accurately classified treatment phases with high accuracy (RF: 0.81 ± 0.04, AB: 0.78 ± 0.05). The metabolites sorbitol and N-methylisoleucine were identified as important predictive features and confirmed via SHAP. CONCLUSION: To our knowledge, this is the first study to describe plasma biogenic amine signatures throughout the treatment of patients with glioblastoma. A larger study is needed to confirm these results with hopes of developing a diagnostic algorithm.

4.
Curr Res Food Sci ; 6: 100500, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37151381

RESUMO

The information on nutritional profile of cooked foods is important to both food manufacturers and consumers, and a major challenge to obtaining precise information is the inherent variation in composition across biological samples of any given raw ingredient. The ideal solution would address precision and generability, but the current solutions are limited in their capabilities; analytical methods are too costly to scale, retention-factor based methods are scalable but approximate, and kinetic models are bespoke to a food and nutrient. We provide an alternate solution that predicts the micronutrient profile in cooked food from the raw food composition, and for multiple foods. The prediction model is trained on an existing food composition dataset and has a 31% lower error on average (across all foods, processes and nutrients) than predictions obtained using the baseline method of retention-factors. Our results argue that data scaling and transformation prior to training the models is important to mitigate any yield bias. This study demonstrates the potential of machine learning methods over current solutions, and additionally provides guidance for the future generation of food composition data, specifically for sampling approach, data quality checks, and data representation standards.

5.
PLoS One ; 17(11): e0278121, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36449508

RESUMO

Tax audits are a crucial process adopted in all tax departments to ensure tax compliance and fairness. Traditionally, tax audit leads have been selected based on empirical rules and randomization methods, which are not adaptive, may miss major cases and can introduce bias. Here, we present an audit lead tool based on artificial neural networks that have been trained and evaluated on an integrated dataset of 93,413 unique tax records from 8,647 restaurant businesses over 10 years in the Northern California, provided by the California Department of Tax and Fee Administration (CDTFA). The tool achieved a 40.1% precision and 58.7% recall (F1-score of 0.42) on classifying positive audit leads, and the corresponding regressor provided estimated audit gains (MAE of $155,490). Finally, we evaluated the statistical significance of various empirical rules for use in lead selection, with two out of five being supported by the data. This work demonstrates how data can be leveraged for creating evidence-based models of audit selection and validating empirical hypotheses, resulting in higher audit yields and more fair audit selection processes.


Assuntos
Comércio , Redes Neurais de Computação , Rememoração Mental , Restaurantes
6.
J Agric Food Chem ; 70(48): 15007-15027, 2022 Dec 07.
Artigo em Inglês | MEDLINE | ID: mdl-36409321

RESUMO

Research continues to provide compelling insights into potential health benefits associated with diets rich in plant-based natural products (PBNPs). Coupled with evidence from dietary intervention trials, dietary recommendations increasingly include higher intakes of PBNPs. In addition to health benefits, PBNPs can drive flavor and sensory perceptions in foods and beverages. Chardonnay marc (pomace) is a byproduct of winemaking obtained after fruit pressing that has not undergone fermentation. Recent research has revealed that PBNP diversity within Chardonnay marc has potential relevance to human health and desirable sensory attributes in food and beverage products. This review explores the potential of Chardonnay marc as a valuable new PBNP ingredient in the food system by combining health, sensory, and environmental sustainability benefits that serves as a model for development of future ingredients within a sustainable circular bioeconomy. This includes a discussion on the potential role of computational methods, including artificial intelligence (AI), in accelerating research and development required to discover and commercialize this new source of PBNPs.


Assuntos
Inteligência Artificial , Indústria Alimentícia , Humanos
7.
J Am Med Inform Assoc ; 30(1): 38-45, 2022 12 13.
Artigo em Inglês | MEDLINE | ID: mdl-36308771

RESUMO

OBJECTIVE: A hallmark of personalized medicine and nutrition is to identify effective treatment plans at the individual level. Lifestyle interventions (LIs), from diet to exercise, can have a significant effect over time, especially in the case of food intolerances and allergies. The large set of candidate interventions, make it difficult to evaluate which intervention plan would be more favorable for any given individual. In this study, we aimed to develop a method for rapid identification of favorable LIs for a given individual. MATERIALS AND METHODS: We have developed a method, algorithmic lifestyle optimization (ALO), for rapid identification of effective LIs. At its core, a group testing algorithm identifies the effectiveness of each intervention efficiently, within the context of its pertinent group. RESULTS: Evaluations on synthetic and real data show that ALO is robust to noise, data size, and data heterogeneity. Compared to the standard of practice techniques, such as the standard elimination diet (SED), it identifies the effective LIs 58.9%-68.4% faster when used to discover an individual's food intolerances and allergies to 19-56 foods. DISCUSSION: ALO achieves its superior performance by: (1) grouping multiple LIs together optimally from prior statistics, and (2) adapting the groupings of LIs from the individual's subsequent responses. Future extensions to ALO should enable incorporating nutritional constraints. CONCLUSION: ALO provides a new approach for the discovery of effective interventions in nutrition and medicine, leading to better intervention plans faster and with less inconvenience to the patient compared to SED.


Assuntos
Hipersensibilidade , Estilo de Vida , Humanos , Dieta , Exercício Físico , Medicina de Precisão
8.
Sci Rep ; 12(1): 14489, 2022 08 25.
Artigo em Inglês | MEDLINE | ID: mdl-36008537

RESUMO

The aim of this study was to derive a model to predict the risk of dogs developing chronic kidney disease (CKD) using data from electronic health records (EHR) collected during routine veterinary practice. Data from 57,402 dogs were included in the study. Two thirds of the EHRs were used to build the model, which included feature selection and identification of the optimal neural network type and architecture. The remaining unseen EHRs were used to evaluate model performance. The final model was a recurrent neural network with 6 features (creatinine, blood urea nitrogen, urine specific gravity, urine protein, weight, age). Identifying CKD at the time of diagnosis, the model displayed a sensitivity of 91.4% and a specificity of 97.2%. When predicting future risk of CKD, model sensitivity was 68.8% at 1 year, and 44.8% 2 years before diagnosis. Positive predictive value (PPV) varied between 15 and 23% and was influenced by the age of the patient, while the negative predictive value remained above 99% under all tested conditions. While the modest PPV limits its use as a stand-alone diagnostic screening tool, high specificity and NPV make the model particularly effective at identifying patients that will not go on to develop CKD.


Assuntos
Laboratórios Clínicos , Insuficiência Renal Crônica , Animais , Nitrogênio da Ureia Sanguínea , Creatinina , Cães , Valor Preditivo dos Testes , Insuficiência Renal Crônica/diagnóstico , Insuficiência Renal Crônica/veterinária
9.
Comput Struct Biotechnol J ; 20: 3473-3481, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35860406

RESUMO

The rational design of T Cell Receptors (TCRs) for immunotherapy has stagnated due to a limited understanding of the dynamic physiochemical features of the TCR that elicit an immunogenic response. The physiochemical features of the TCR-peptide major histocompatibility complex (pMHC) bond dictate bond lifetime which, in turn, correlates with immunogenicity. Here, we: i) characterize the force-dependent dissociation kinetics of the bond between a TCR and a set of pMHC ligands using Steered Molecular Dynamics (SMD); and ii) implement a machine learning algorithm to identify which physiochemical features of the TCR govern dissociation kinetics. Our results demonstrate that the total number of hydrogen bonds between the CDR2ß-MHC⍺(ß), CDR1α-Peptide, and CDR3ß-Peptide are critical features that determine bond lifetime.

10.
Int J Mol Sci ; 23(11)2022 May 26.
Artigo em Inglês | MEDLINE | ID: mdl-35682665

RESUMO

Microorganisms often live in complex habitats, where changes in the environment are predictable, providing an opportunity for microorganisms to learn, anticipate the upcoming environmental changes and prepare in advance for better survival and growth. One such environment is the mammalian intestine, where the abundance of different carbon sources is spatially distributed. In this study, we identified seven spatially distributed carbon sources in the mammalian intestine and tested whether Escherichia coli exhibits phenotypes that are consistent with an anticipatory response given their spatial order and abundance within the mammalian intestine. Through RNA-Seq and RT-PCR validation measurements, we found that there was a 67% match in the expression patterns between the measured phenotypes and what would otherwise be expected in the case of anticipatory behavior, while 83% and 0% were in agreement with the homeostatic and random response, respectively. To understand the genetic and phenotypic basis of the discrepancies between the expected and measured anticipatory responses, we thoroughly investigated the discrepancy in D-galactose treatment and the expression of maltose operon in E. coli. Here, the expected anticipatory response, based on the spatial distribution of D-galactose and D-maltose, was that D-galactose should upregulate the maltose operon, but it was the opposite in experimental validation. We performed whole genome random mutagenesis and screening and identified E. coli strains with positive expression of maltose operon in D-galactose. Targeted Sanger sequencing and mutation repair identified that the mutations in the promoter region of malT and in the coding region of the crp gene were the factors responsible for the reversion in the association. Further, to identify why positive association in the D-galactose treatment and the expression of the maltose operon did not evolve naturally, fitness measurements were performed. Fitness experiments demonstrated that the fitness of E. coli strains with a positive association in the D-galactose treatment and the expression of the maltose operon was 12% to 20% lower than that of the wild type strain.


Assuntos
Escherichia coli , Maltose , Carbono/metabolismo , Escherichia coli/metabolismo , Galactose/metabolismo , Maltose/genética , Maltose/metabolismo , Mutação , Óperon/genética
11.
Nat Commun ; 13(1): 2360, 2022 04 29.
Artigo em Inglês | MEDLINE | ID: mdl-35487919

RESUMO

We present a machine learning framework to automate knowledge discovery through knowledge graph construction, inconsistency resolution, and iterative link prediction. By incorporating knowledge from 10 publicly available sources, we construct an Escherichia coli antibiotic resistance knowledge graph with 651,758 triples from 23 triple types after resolving 236 sets of inconsistencies. Iteratively applying link prediction to this graph and wet-lab validation of the generated hypotheses reveal 15 antibiotic resistant E. coli genes, with 6 of them never associated with antibiotic resistance for any microbe. Iterative link prediction leads to a performance improvement and more findings. The probability of positive findings highly correlates with experimentally validated findings (R2 = 0.94). We also identify 5 homologs in Salmonella enterica that are all validated to confer resistance to antibiotics. This work demonstrates how evidence-driven decisions are a step toward automating knowledge discovery with high confidence and accelerated pace, thereby substituting traditional time-consuming and expensive methods.


Assuntos
Infecções por Escherichia coli , Salmonella enterica , Antibacterianos/farmacologia , Farmacorresistência Bacteriana/genética , Escherichia coli/genética , Humanos , Salmonella enterica/genética
12.
Metab Eng ; 69: 50-58, 2022 01.
Artigo em Inglês | MEDLINE | ID: mdl-34763090

RESUMO

Previously, Escherichia coli was engineered to produce isobutyl acetate (IBA). Titers greater than the toxicity threshold (3 g/L) were achieved by using layer-assisted production. To avoid this costly and complex method, adaptive laboratory evolution (ALE) was applied to E. coli for improved IBA tolerance. Over 37 rounds of selective pressure, 22 IBA-tolerant mutants were isolated. Remarkably, these mutants not only tolerate high IBA concentrations, they also produce higher IBA titers. Using whole-genome sequencing followed by CRISPR/Cas9 mediated genome editing, the mutations (SNPs in metH, rho and deletion of arcA) that confer improved tolerance and higher titers were elucidated. The improved IBA titers in the evolved mutants were a result of an increased supply of acetyl-CoA and altered transcriptional machinery. Without the use of phase separation, a strain capable of 3.2-fold greater IBA production than the parent strain was constructed by combing select beneficial mutations. These results highlight the impact improved tolerance has on the production capability of a biosynthetic system.


Assuntos
Proteínas de Escherichia coli , Escherichia coli , Acetatos , Escherichia coli/genética , Proteínas de Escherichia coli/genética , Laboratórios
13.
Front Microbiol ; 12: 680553, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34248896

RESUMO

Glutaraldehyde is a widely used biocide on the market for about 50 years. Despite its broad application, several reports on the emergence of bacterial resistance, and occasional outbreaks caused by poorly disinfection, there is a gap of knowledge on the bacterial adaptation, tolerance, and resistance mechanisms to glutaraldehyde. Here, we analyze the effects of the independent selection of mutations in the transcriptional regulator yqhC for biological replicates of Escherichia coli cells subjected to adaptive laboratory evolution (ALE) in the presence of glutaraldehyde. The evolved strains showed improved survival in the biocide (11-26% increase in fitness) as a result of mutations in the activator yqhC, which led to the overexpression of the yqhD aldehyde reductase gene by 8 to over 30-fold (3.1-5.2 log2FC range). The protective effect was exclusive to yqhD as other aldehyde reductase genes of E. coli, such as yahK, ybbO, yghA, and ahr did not offer protection against the biocide. We describe a novel mechanism of tolerance to glutaraldehyde based on the activation of the aldehyde reductase YqhD by YqhC and bring attention to the potential for the selection of such tolerance mechanism outside the laboratory, given the existence of YqhD homologs in various pathogenic and opportunistic bacterial species.

15.
Front Microbiol ; 12: 640923, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33717036

RESUMO

Biocide use is essential and ubiquitous, exposing microbes to sub-inhibitory concentrations of antiseptics, disinfectants, and preservatives. This can lead to the emergence of biocide resistance, and more importantly, potential cross-resistance to antibiotics, although the degree, frequency, and mechanisms that give rise to this phenomenon are still unclear. Here, we systematically performed adaptive laboratory evolution of the gut bacteria Escherichia coli in the presence of sub-inhibitory, constant concentrations of ten widespread biocides. Our results show that 17 out of 40 evolved strains (43%) also decreased the susceptibility to medically relevant antibiotics. Through whole-genome sequencing, we identified mutations related to multidrug efflux proteins (mdfA and acrR), porins (envZ and ompR), and RNA polymerase (rpoA and rpoBC), as mechanisms behind the resulting (cross)resistance. We also report an association of several genes (yeaW, pyrE, yqhC, aes, pgpA, and yeeP-isrC) and specific mutations that induce cross-resistance, verified through mutation repairs. A greater capacity for biofilm formation with respect to the parent strain was also a common feature in 11 out of 17 (65%) cross-resistant strains. Evolution in the biocides chlorophene, benzalkonium chloride, glutaraldehyde, and chlorhexidine had the most impact in antibiotic susceptibility, while hydrogen peroxide and povidone-iodine the least. No cross-resistance to antibiotics was observed for isopropanol, ethanol, sodium hypochlorite, and peracetic acid. This work reinforces the link between exposure to biocides and the potential for cross-resistance to antibiotics, presents evidence on the underlying mechanisms of action, and provides a prioritized list of biocides that are of greater concern for public safety from the perspective of antibiotic resistance. SIGNIFICANCE STATEMENT: Bacterial resistance and decreased susceptibility to antimicrobials is of utmost concern. There is evidence that improper biocide (antiseptic and disinfectant) use and discard may select for bacteria cross-resistant to antibiotics. Understanding the cross-resistance emergence and the risks associated with each of those chemicals is relevant for proper applications and recommendations. Our work establishes that not all biocides are equal when it comes to their risk of inducing antibiotic resistance; it provides evidence on the mechanisms of cross-resistance and a risk assessment of the biocides concerning antibiotic resistance under residual sub-inhibitory concentrations.

16.
Clin Nutr ; 40(6): 4414-4421, 2021 06.
Artigo em Inglês | MEDLINE | ID: mdl-33504454

RESUMO

OBJECTIVE: Identification of microbiota-based biomarkers as predictors of low-FODMAP diet response and design of a diet recommendation strategy for IBS patients. DESIGN: We created a compendium of gut microbiome and disease severity data before and after a low-FODMAP diet treatment from published studies followed by unified data processing, statistical analysis and predictive modeling. We employed data-driven methods that solely rely on the compendium data, as well as hypothesis-driven methods that focus on methane and short chain fatty acid (SCFA) metabolism pathways that were implicated in the disease etiology. RESULTS: The patient's response to a low-FODMAP diet was predictable using their pre-diet fecal samples with F1 accuracy scores of 0.750 and 0.875 achieved through data-driven and hypothesis-driven predictors, respectively. The fecal microbiome of patients with high response had higher abundance of methane and SCFA metabolism pathways compared to patients with no response (p-values < 6 × 10-3). The genera Ruminococcus 1, Ruminococcaceae UCG-002 and Anaerostipes can be used as predictive biomarkers of diet response. Furthermore, the low-FODMAP diet followers were identifiable given their microbiome data (F1-score of 0.656). CONCLUSION: Our integrated data analysis results argue that there are two types of patients, those with high colonic methane and SCFA production, who will respond well on a low-FODMAP diet, and all others, who would benefit a dietary supplementation containing butyrate and propionate, as well as probiotics with SCFA-producing bacteria, such as lactobacillus. This work demonstrates how data integration can lead to novel discoveries and paves the way towards personalized diet recommendations for IBS.


Assuntos
Dieta com Restrição de Carboidratos , Ácidos Graxos Voláteis/metabolismo , Microbioma Gastrointestinal , Síndrome do Intestino Irritável/dietoterapia , Metano/metabolismo , Bactérias/metabolismo , Fezes/microbiologia , Humanos , Síndrome do Intestino Irritável/metabolismo , Síndrome do Intestino Irritável/microbiologia , Redes e Vias Metabólicas , Metagenoma
17.
Clin Exp Rheumatol ; 39(3): 508-518, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-32662400

RESUMO

OBJECTIVES: Prediction and determination of drug efficacy for radiographic progression is limited by the heterogeneity inherent in axial spondyloarthritis (axSpA). We investigated whether unbiased clustering analysis of phenotypic data can lead to coherent subgroups of axSpA patients with a distinct risk of radiographic progression. METHODS: A group of 412 patients with axSpA was clustered in an unbiased way using a agglomerative hierarchical clustering method, based on their phenotype mapping. We used a generalised linear model, naïve Bayes, Decision Trees, K-Nearest-Neighbors, and Support Vector Machines to construct a consensus classification method. Radiographic progression over 2 years was assessed using the modified Stoke Ankylosing Spondylitis Spine Score (mSASSS). RESULTS: axSpA patients were classified into three distinct subgroups with distinct clinical characteristics. Sex, smoking, HLA-B27, baseline mSASSS, uveitis, and peripheral arthritis were the key features that were found to stratifying the phenogroups. The three phenogroups showed distinct differences in radiographic progression rate (p<0.05) and the proportion of progressors (p<0.001). Phenogroup 2, consisting of male smokers, had the worst radiographic progression, while phenogroup 3, exclusively suffering from uveitis, showed the least radiographic progression. The axSpA phenogroup classification, including its ability to stratify risk, was successfully replicated in an independent validation group. CONCLUSIONS: Phenotype mapping results in a clinically relevant classification of axSpA that is applicable for risk stratification. Novel coupling between phenotypic features and radiographic progression can provide a glimpse into the mechanisms underlying divergent and shared features of axSpA.


Assuntos
Espondilartrite , Espondilite Anquilosante , Teorema de Bayes , Humanos , Aprendizado de Máquina , Masculino , Coluna Vertebral , Espondilartrite/diagnóstico por imagem
18.
Foods ; 10(1)2020 Dec 22.
Artigo em Inglês | MEDLINE | ID: mdl-33374916

RESUMO

Interest in animal cell-based meat (ACBM) or laboratory-grown meat has been increasing; however, the economic viability of these potential products has not been thoroughly vetted. Recent studies suggest monoclonal antibody production technology can be adapted for the industrialization of ACBM production. This study provides a scenario-based assessment of the projected cost per kilogram of ACBM produced in the United States based on cellular metabolic requirements and process/chemical engineering conventions. A sensitivity analysis of the model identified the nine most influential cost factors for ACBM production out of 67 initial parameters. The results indicate that technological performance will need to approach technical limits for ACBM to achieve profitably as a commodity. However, the model also suggests that low-volume high-value specialty products could be viable based on current technology.

19.
Nat Commun ; 11(1): 5026, 2020 10 06.
Artigo em Inglês | MEDLINE | ID: mdl-33024104

RESUMO

How to design experiments that accelerate knowledge discovery on complex biological landscapes remains a tantalizing question. We present an optimal experimental design method (coined OPEX) to identify informative omics experiments using machine learning models for both experimental space exploration and model training. OPEX-guided exploration of Escherichia coli's populations exposed to biocide and antibiotic combinations lead to more accurate predictive models of gene expression with 44% less data. Analysis of the proposed experiments shows that broad exploration of the experimental space followed by fine-tuning emerges as the optimal strategy. Additionally, analysis of the experimental data reveals 29 cases of cross-stress protection and 4 cases of cross-stress vulnerability. Further validation reveals the central role of chaperones, stress response proteins and transport pumps in cross-stress exposure. This work demonstrates how active learning can be used to guide omics data collection for training predictive models, making evidence-driven decisions and accelerating knowledge discovery in life sciences.


Assuntos
Biologia Computacional/métodos , Escherichia coli/efeitos dos fármacos , Escherichia coli/genética , Modelos Biológicos , Antibacterianos/farmacologia , Proteínas de Bactérias/genética , Desinfetantes/farmacologia , Regulação Bacteriana da Expressão Gênica/efeitos dos fármacos , Aprendizado de Máquina , Proteínas de Membrana/genética , Chaperonas Moleculares/genética , Projetos de Pesquisa , Estresse Fisiológico/efeitos dos fármacos , Estresse Fisiológico/genética
20.
Biochemistry ; 59(40): 3834-3843, 2020 10 13.
Artigo em Inglês | MEDLINE | ID: mdl-32935984

RESUMO

To complement established rational and evolutionary protein design approaches, significant efforts are being made to utilize computational modeling and the diversity of naturally occurring protein sequences. Here, we combine structural biology, genomic mining, and computational modeling to identify structural features critical to aldehyde deformylating oxygenases (ADOs), an enzyme family that has significant implications in synthetic biology and chemoenzymatic synthesis. Through these efforts, we discovered latent ADO-like function across the ferritin-like superfamily in various species of Bacteria and Archaea. We created a machine learning model that uses protein structural features to discriminate ADO-like activity. Computational enzyme design tools were then utilized to introduce ADO-like activity into the small subunit of Escherichia coli class I ribonucleotide reductase. The integrated approach of genomic mining, structural biology, molecular modeling, and machine learning has the potential to be utilized for rapid discovery and modulation of functions across enzyme families.


Assuntos
Alcanos/metabolismo , Bactérias/enzimologia , Proteínas de Bactérias/metabolismo , Ferritinas/metabolismo , Engenharia de Proteínas , Aldeídos/metabolismo , Bactérias/química , Bactérias/genética , Bactérias/metabolismo , Proteínas de Bactérias/química , Proteínas de Bactérias/genética , Ferritinas/química , Ferritinas/genética , Genes Bacterianos , Modelos Moleculares , Oxigenases/química , Oxigenases/genética , Oxigenases/metabolismo , Conformação Proteica , Ribonucleotídeo Redutases/química , Ribonucleotídeo Redutases/genética , Ribonucleotídeo Redutases/metabolismo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...